Search CORE

23 research outputs found

Matching Theory and Data with Personal-ITY:What a Corpus of Italian YouTube Comments Reveals About Personality

Author: Bassignana Elisa
Nissim Malvina
Patti Viviana
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2020
Field of study

Proceedings - University of Groningen

Personal-ITY:A Novel YouTube-based Corpus for Personality Prediction in Italian

Author: Bassignana Elisa
Nissim Malvina
Patti Viviana
Publication venue: CEUR-WS.org
Publication date: 01/01/2020
Field of study

ARTS repository - University of Groningen

Matching Theory and Data with Personal-ITY:What a Corpus of Italian YouTube Comments Reveals About Personality

Author: Bassignana Elisa
Nissim Malvina
Patti Viviana
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2020
Field of study

ARTS repository - University of Groningen

Matching Theory and Data with Personal-ITY: What a Corpus of Italian YouTube Comments Reveals About Personality

Author: Bassignana Elisa
Nissim Malvina
Patti Viviana
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2020
Field of study

As a contribution to personality detection in languages other than English, we rely on distant supervision to create Personal-ITY, a novel corpus of YouTube comments in Italian, where authors are labelled with personality traits. The traits are derived from one of the mainstream personality theories in psychology research, named MBTI. Using personality prediction experiments, we (i) study the task of personality prediction in itself on our corpus as well as on TwiSty, a Twitter dataset also annotated with MBTI labels; (ii) carry out an extensive, in-depth analysis of the features used by the classifier, and view them specifically under the light of the original theory that we used to create the corpus in the first place. We observe that no single model is best at personality detection, and that while some traits are easier than others to detect, and also to match back to theory, for other, less frequent traits the picture is much more blurred.Comment: 12 pages, Accepted at PEOPLES 2020 (workshop COLING 2020). arXiv admin note: text overlap with arXiv:2011.0568

arXiv.org e-Print Archive

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

The IT University of Copenhagen's Repository

Dissertations of the University of Groningen

Matching Theory and Data with Personal-ITY:What a Corpus of Italian YouTube Comments Reveals About Personality

Author: Bassignana Elisa
Nissim Malvina
Patti Viviana
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2020
Field of study

University of Groningen

Personal-ITY:A Novel YouTube-based Corpus for Personality Prediction in Italian

Author: Bassignana Elisa
Nissim Malvina
Patti Viviana
Publication venue: CEUR-WS.org
Publication date: 01/01/2020
Field of study

We present a novel corpus for personality prediction in Italian, containing a larger number of authors and a different genre compared to previously available resources. The corpus is built exploiting Distant Supervision, assigning Myers-Briggs Type Indicator (MBTI) labels to YouTube comments, and can lend itself to a variety of experiments. We report on preliminary experiments on Personal-ITY, which can serve as a baseline for future work, showing that some types are easier to predict than others, and discussing the perks of cross-dataset prediction

arXiv.org e-Print Archive

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

OpenEdition

Institutional Research Information System University of Turin

Dissertations of the University of Groningen

Hurtlex: A multilingual lexicon of words to hurt

Author: Basile Valerio
Bassignana Elisa
Patti Viviana
Publication venue: CEUR-WS
Publication date: 01/01/2018
Field of study

Institutional Research Information System University of Turin

Personal-ITY:A Novel YouTube-based Corpus for Personality Prediction in Italian

Author: Bassignana Elisa
Nissim Malvina
Patti Viviana
Publication venue: CEUR-WS.org
Publication date: 01/01/2020
Field of study

OpenEdition

Dissertations of the University of Groningen

Hurtlex: A Multilingual Lexicon of Words to Hurt

Author: Basile Valerio
Bassignana Elisa
Patti Viviana
Publication venue: 'OpenEdition'
Publication date: 01/01/2018
Field of study

We describe the creation of HurtLex, a multilingual lexicon of hate words. The starting point is the Italian hate lexicon developed by the linguist Tullio De Mauro, organized in 17 categories. It has been expanded through the link to available synset-based computational lexical resources such as MultiWordNet and BabelNet, and evolved in a multi-lingual perspective by semi-automatic translation and expert annotation. A twofold evaluation of HurtLex as a resource for hate speech detection in social media is provided: a qualitative evaluation against an Italian annotated Twitter corpus of hate against immigrants, and an extrinsic evaluation in the context of the AMI@Ibereval2018 shared task, where the resource was exploited for extracting domain-specific lexicon-based features for the supervised classification of misogyny in English and Spanish tweets.L’articolo descrive lo sviluppo di Hurtlex, un lessico multilingue di parole per ferire. Il punto di partenza è il lessico di parole d’odio italiane sviluppato dal linguista Tullio De Mauro, organizzato in 17 categorie. Il lessico è stato espanso sfruttando risorse lessicali sviluppate dalla comunità di Linguistica Computazionale come MultiWordNet e BabelNet e le sue controparti in altre lingue sono state generate semi-automaticamente con traduzione ed annotazione manuale di esperti. Viene presentata sia un’analisi qualitativa della nuova risorsa, mediante l’analisi di corpus di tweet italiani annotati per odio nei confronti dei migranti e una valutazione estrinseca, mediante l’uso della risorsa nell’ambito dello sviluppo di un sistema Automatic Misogyny Identification in tweet in spagnolo ed inglese

Crossref

OpenEdition

MULTI-CROSSRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction

Author: Bassignana Elisa
Ginter Filip
Goot Rob van der
Plank Barbara
Pyysalo Sampo
Publication venue: University of Tartu Library
Publication date: 01/05/2023
Field of study

DSpace at Tartu University Library